----yyyy Report statistics and ranges of estimated times per event.
Without the -y option, perfex reports the counts recorded by the
R10000 event counters for the events requested. As these are simply
raw counts, it is difficult to know by inspection which events are
responsible for significant portions of the job's run time. The -y
option associates an approximate time cost with some of the event
counts.
The reported times are approximate. Due to the superscalar nature
of the R10000, and its ability to hide latency, one cannot state a
precise cost for a single occurrence of many of the events. Cache
misses, for example, can be overlapped with other operations, so
there is a wide range of times possible for any cache miss.
To account for the fact that the cost of many events cannot be known
precisely, perfex -y reports a range of time costs for each event.
""""MMMMaaaaxxxxiiiimmmmuuuummmm,,,,"""" """"mmmmiiiinnnniiiimmmmuuuummmm,,,,"""" and """"ttttyyyyppppiiiiccccaaaallll"""" time costs are reported. Each is
obtained by consulting an internal table which holds the "maximum,"
"minimum," and "typical" costs for each event, and multiplying this
cost by the count for the event. Event costs are usually measured in
terms of machine cycles, and so the cost of an event generally
depends on the clock speed of the processor, which is also reported
in the output.
The "maximum" value contained in the table corresponds to the worst
case cost of a single occurrence of the event. Sometimes this can be
a very pessimistic estimate. For example, the maximum cost for
graduated floating point instructions assumes that all such
instructions are double precision reciprocal square roots, since
that is the most costly R10000 floating point instruction.
Due to the latency-hiding capabilities of the R10000, the "minimum"
cost of virtually any event could be zero since most events can be
overlapped with other operations. To avoid simply reporting minimum
costs of 0, which would be of no practical use, the "minimum" time
reported by perfex -y corresponds to the best case cost of a single